智能论文笔记

Visualizing Information Bottleneck through Variational Inference

Cipta Herwana , Abhishek Kadian

分类：机器学习

2022-12-24

The Information Bottleneck theory provides a theoretical and computational framework for finding approximate minimum sufficient statistics. Analysis of the Stochastic Gradient Descent (SGD) training of a neural network on a toy problem has shown the existence of two phases, fitting and compression. In this work, we analyze the SGD training process of a Deep Neural Network on MNIST classification and confirm the existence of two phases of SGD training. We also propose a setup for estimating the mutual information for a Deep Neural Network through Variational Inference.

translated by 谷歌翻译

相关文章
笔记